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Abstract 

Background: Confirmation bias is the tendency to acquire or evaluate new information in a way that is consistent with one's 
preexisting beliefs. It is omnipresent in psychology, economics, and even scientific practices. Prior theoretical research of 
this phenomenon has mainly focused on its economic implications possibly missing its potential connections with broader 
notions of cognitive science. 

Methodology/Principal Findings: We formulate a (non-Bayesian) model for revising subjective probabilistic opinion of a 
confirmationally-biased agent in the light of a persuasive opinion. The revision rule ensures that the agent does not react to 
persuasion that is either far from his current opinion or coincides with it. We demonstrate that the model accounts for the 
basic phenomenology of the social judgment theory, and allows to study various phenomena such as cognitive dissonance 
and boomerang effect. The model also displays the order of presentation effect-when consecutively exposed to two 
opinions, the preference is given to the last opinion (recency) or the first opinion (primacy) -and relates recency to 
confirmation bias. Finally, we study the model in the case of repeated persuasion and analyze its convergence properties. 

Conclusions: The standard Bayesian approach to probabilistic opinion revision is inadequate for describing the observed 
phenomenology of persuasion process. The simple non-Bayesian model proposed here does agree with this 
phenomenology and is capable of reproducing a spectrum of effects observed in psychology: primacy-recency 
phenomenon, boomerang effect and cognitive dissonance. We point out several limitations of the model that should 
motivate its future development. 



CrossMark 

dhl fa updtfM 



Citation: Allahverdyan AE, Galstyan A (2014) Opinion Dynamics with Confirmation Bias. PLoS ONE 9(7): e99557. doi:10.1371/journal.pone.0099557 
Editor: Pablo Branas-Garza, Middlesex University London, United Kingdom 
Received December 8, 2013; Accepted May 15, 2014; Published July 9, 2014 

Copyright: © 2014 Allahverdyan, Galstyan. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which 
permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. 

Funding: This research was supported by DARPA grant No. W91 1NF-12-1-0034 and AFOSR MURI grant No. FA9550-1 0-1 -0569. The funders had no role in study 
design, data collection and analysis, decision to publish, or preparation of the manuscript. 

Competing Interests: The authors have declared that no competing interests exist. 

* Email: armen.allahverdyan@gmail.com 



Introduction 

Confirmation bias is the tendency to acquire or process new 
information in a way that confirms one's preconceptions and 
avoids contradiction with prior beliefs [52]. Various manifestations 
of this bias have been reported in cognitive psychology [5,67], 
social psychology [24,54], politics [46] and (media) economics 
[31,51,57,73]. Recent evidence suggests that scientific practices 
too are susceptible to various forms of confirmation bias 
[12,38,43,44,52], even though the imperative of avoiding precisely 
this bias is frequently presented as one of the pillars of the scientific 
method. 

Here we are interested in the opinion revision of an agent V 
who is persuaded (or advised) by another agent Q [10,13,52]. 
(Below we use the terms opinion and belief interchangeably.) We 
follow the known framework for representing uncertain opinions 
of both agents via the subjective probability theory [13]. Within 
this framework, the opinion of an agent about propositions (events) 
is described by probabilities that quantify his degree of confidence 
in the truth of these propositions [13]. As we argue in the next 
section, the standard Bayesian approach to opinion revision is 
inadequate for describing persuasion. Instead, here we study 
confirmationally-biased persuasion within the opinion combina- 
tion approach developed in statistics; see [21,30] for reviews. 

We suggest a set of conditions that model cognitive aspects of 
confirmation bias. Essentially, those conditions formalize the 



intuition that the agent V does not change his opinion if the 
persuasion is either far away or identical with his existing opinion 
[15,60]. We then propose a simple opinion revision rule that 
satisfies those conditions and is consistent with the ordinary 
probability theory. The rule consists of two elementary operations: 
averaging the initial opinion with the persuading opinion via linear 
combination, and then projecting it onto the initial opinion. The 
actual existence of these two operations has an experimental 
support [8,9,18,72,]. 

We demonstrate that the proposed revision rule is consistent 
with the social judgment theory [10], and reproduces the so called 
change-discrepancy relationship [10,35,40,45,69]. Furthermore, the 
well-studied weighted average approach [9,27] for opinion revision is 
shown to be a particular case of our model. 

Our analysis of the revision rule also reveals novel effects. In 
particular, it is shown that within the proposed approach, the 
recency effect is related to confirmation bias. Also, repeated 
persuasions are shown to hold certain monotonicity features, but 
do not obey the law of diminishing returns. We also demonstrate 
that the rule reproduces several basic features of the cognitive 
dissonance phenomenon and predicts new scenarios of its emer- 
gence. Finally, the so called boomerang {backfire) effect can emerge as 
an extreme form of confirmation bias. The effect is given a 
straightforward mathematical description in qualitative agreement 
with experiments. 
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The rest of this paper is organized as follows. In the next section 
we introduce the problem setup and provide a brief survey of 
relevant work, specifically focusing on inadequacy of the standard 
Bayesian approach to opinion revision under persuasion. In the 
third section we define our axioms and introduce the confirma- 
tionally biased opinion revision rule. The fourth section relates our 
setup to the social judgment theory. Next two sections describe 
how our model accounts for two basic phenomena of experimental 
social psychology: opinion change versus discrepancy and the 
order of presentation effect. The seventh section shows how our 
model formalizes features of cognitive dissonance, followed by 
analysis of opinion change under repeated persuasion. Then we 
study the boomerang effect-the agent changes his opinion not towards 
the persuasion, but against it- as a particular case of our approach. 
We summarize and conclude in the last section. 

The Set-Up and Previous Research 

Consider two agents V and Q. They are given an uncertain 
quantity (random variable) X with values k=\,...,N, e.g. 
k= (rain, nor am), if this is a weather forecast. X constitutes the 
state of the world for V and Q. The opinions of the agents are 
quantified via probabilities 

N N 

p = {pk} N k= l &ndq = {q k }^ 1 ,J2Pk=J2l k = l W 

k=\ k=l 

for V and Q respectively. 

Let us now assume that V is persuaded (or advised) by Q. 
(Persuasion and advising are not completely equivalent [71]. 
However, in the context of our discussion it will be useful to 
employ both terms simultaneously stressing their commmon 
aspects.) Throughout this paper we assume that the state of the 
world does not change, and that the agents are aware of this fact. 
Hence, T is going to change his opinion only under influence of 
the opinion of Q, and not due to any additional knowledge about 
X (For more details on this point see [3,41] and the second section 
of File SI.) 

The normative standard for opinion revision is related to the 
Bayesian approach. Below we discuss the main elements of the 
Bayesian approach, and outiine certain limitations that motivates 
the non-Bayesian revision rule suggested in this work. 

Within the Bayesian approach, the agent V treats his own 
probabilistic opinion p = {pk}k=\ as a prior, and the probabilistic 
opinion q = {q k }k = \ °f Q as an evidence [28,30,47]. Next, it is 
assumed that V is endowed with conditional probability densities 
Yl(q\k), which statistically relate q to the world state k. Upon 
receiving the evidence from Q, agent V modifies his opinion from 
p t to p(k\q) via the Bayes rule: 

N 

^=ww£n(#,. (2) 
1=1 

One issue with the Bayesian approach is that the assumption on 
the existence and availability of Yl(q\k) may be too strong 
[13,25,30]. Another issue is that existing empirical evidence 
suggests that people do not behave according to the Bayesian 
approach [13,61], e.g. they demonstrate the order of presentation 
effect, which is generally absent within the Bayesian framework. 

In the context of persuasion, the Bayesian approach (2) has two 
additional (and more serious) drawbacks. To explain the first 
drawback, let us make a generic assumption that there is a unique 



index k for which Tl{q\k) is maximized as a function of k (for a 
given q): Yl(q\k)>Tl{q\k) for k^k. 

Now consider repeated application of (2), which corresponds to 
the usual practice of repeated persuasion under the same opinion q 
of Q. The opinion of the agent then tends to be completely 
polarized, i.e. p k — >1 and >0 for k^k. In the context of 
persuasion or advising, we would rather expect that under 
repeated persuasion the opinion of V will converge to that of Q. 

The second issue is that, according to (2), V will change his 
opinion even if he has the same opinion as Q: p = q. This feature 
may not be realistic: we do not expect V to change his opinion, if 
he is persuaded towards the same opinion he has already. This 
drawback of (2) was noted in [28] . (Ref. [28] offers a modification 
of the Bayesian approach that complies with this point, as shown 
in [28] on one particular example. However, that modification 
betrays the spirit of the normative Bayesianism, because it makes 
conditional probabilities depending on the prior probability.) 

It is worthwhile to note that researchers have studied several 
aspects of confirmation bias by looking at certain deviations from 
the Bayes rule, e.g. when the conditional probability are available, 
but the agent does not apply the proper Bayes rule deviating from 
it in certain aspects [31,51,57,73]. One example of this is when the 
(functional) form of the conditional probability is changed 
depending on the evidence received or on the prior probabilities. 
Another example is when the agent does not employ the full 
available evidence and selects only the evidence that can 
potentially confirm his prior expectations [39,48,67]. More 
generally, one has to differentiate between two aspects of the 
confirmation bias that can be displayed either with respect to 
information acquiring, or information assimilation (or both) [52]. 
Our study will concentrate on information assimilation aspect; 
first, because this aspect is not studied sufficiently well, and second, 
because because it seems to be more directly linked to cognitive 
limitations [52]. We also stress that we focus on the belief revision, 
and not on actions an agent might perform based on those beliefs. 

Opinion Revision Rule 

We propose the following conditions that the opinion revision 
rule should satisfy. 

1. The revised opinion pk ofV is represented as 

Pk = F\p k ,q k ]/ X^/Li^'' 9 ']' ( 3 ) 

where -F[xi,X2] is defined over xie[0,oo) and X2e[0,oo). We 
enlarged the natural range xie[0,l] and X2e[0,l], since below we 
plan to consider probabilities that are not necessarily normalized 
to 1. There are at least two reasons for doing so: First, 
experimental studies of opinion elicitation and revision use more 
general normalizations [8,9]. For example, if the probability is 
elicited in percents, the overall normalization is 100. Second, and 
more importandy, the axioms defining subjective (or logical) 
probabilities leave the overall normalization as a free parameter 
[22]. 

We require that .F[xi,X2] is continuous for xie[0,co) and 
X2e[0,oo) and infinitely differentiable for xie(0,oo) and X2e(0,oo). 
Such (or similar) conditions are needed for features that are 
established for certain limiting values of the arguments of F (cf. (5, 
6)) to hold approximately whenever the arguments are close to 
those limiting values. F can also depend on model parameters, as 
seen below. 

Eq. (3) means that V first evaluates the (non-normalized) weight 
F\pk,qk] for the event k based solely on the values otp t and q & and 
then applies overall normalization. A related feature of (3) is that it 
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is local: assume that iV>3 and only the probability q\ is 
communicated by Q to V. This suffices for V to revise his 
probability from p x to p\, and then adjust other probabilities via 
renormalization: 



pi = F\p l ,q l ]/(F\p u q l ] + 1 -pi), 



Pk =Pkl {F\p\ ,q\] + 1 -p\ )k>2. 



(4) 



Eq. (3) can be considered as a succession of such local processes. 
2. lipk = 0 for some k, then pk = 0: 



F[0,y]=0. 



(5) 



The rationale of this condition is that if V sets the probability of 
a certain event strictly to zero, then he sees logical (or factual) 
reasons for prohibiting the occurrence of this event. Hence V is 
not going to change this zero probability under persuasion. 

3. lfptqk=0 for all k, then pk=Pk'- V cannot be persuaded by 
Q if their opinions have no overlap. 

4. If Q's and V's opinions are identical, then the latter will not 
change his opinion: {Pk}k=l = {<7i}ifc=l (f° r a U ^) leads to 



{Pk}k = i = {Pk}k=i - This can be written as 



-F[x,x] = x. 



(6) 



Conditions 3 and 4 are motivated by experimental results in 
social psychology, which state that people are not persuaded by 
opinions that are either very far, or very close to their initial 
opinion [10,17,69]. 

(Recall that we do not allow the uncertain quantity X to change 
during the persuasion or advising. If such a change is allowed, 4 
may not be natural as the following example shows. Assume that T 
holds a probabilistic opinion (0.1,0.9) on a binary X. Let V learns 
that X changed, but he does not know in which specific way it did. 
Now V meets Q who has the same opinion (0.1,0.9). Provided 
that Q does not echo the opinion of V, the agent V should perhaps 
change his opinion by decreasing the first probability (0.1) towards 
a smaller value, because it is likely that X changed in that 
direction.) 

5. _Fis a homogeneous function of order one: 



F[yx,yy] = yF[x,y] for y >0. 



(7) 



The rationale for this condition comes from the fact that 
(depending on the experimental situation) the subjective proba- 
bility may be expressed not in normalization one (i.e. not with 

2_^k- 1 P k = / yj— i 1 k = 1)' b ut w ' tn a different overall normal- 

ization (e.g. )^k=i Pk = Z^k=i qk = 7 ^ t 8 ' 9 ' 22 ^ cf - 1 In tms 
light, (7) simply states that any choice of the overall normalization 
is consistent with the sought rule provided that it is the same for V 
and Q. Any rescaling of the overall normalization by the factor y 
will rescale the non-normalized probability by the same factor y; 
cf- (7). 

6. Now we assume that the opinion assimilation by V consists of 
two sub-processes. Both are related to heuristics of human 
judgement. 



6.1 V combines his opinion linearly with the opinion of Q 
[8,9,18,29,30]: 



Pk=tPk+(\-()qk, o<e<i, 



(8) 



where £ is a weight. Several mathematical interpretations of the 
weight £ were given in statistics, where (8) emerged as one of the 
basic rules of probabilistic opinion combination [16,29]; see 
section I of File SI. One interpretation suggested by this approach 
is that £ and 1— £ are the probabilities-from the subjective 
viewpoint of V -for, respectively,/) and q to be the true description 
of states of the world [29]: it is not known to V which one of these 
probabilities [p or q) conveys a more accurate reflection of the 
world state. Then {pk}%=i is just the marginal probability for the 
states of the world. There is also an alternative (normative) way of 
deriving (8) from maximization of an average utility that under 
certain natural assumptions can be shown to be the (negated) 
average information loss [16]; see section I of File SI for more 
details. 

Several qualitative factors contribute to the subjective assess- 
ment of £. For instance, one interpretation is to relate £ to 
credibility of Q (as perceived by V): more credible Q leads to a 
larger 1— £ [18]. Several other factors might affect £: egocentric 
attitude of V that tends to discount opinions, simply because they 
do not belong to him; or the fact that V has access to internal 
reasons for choosing his opinion, while he is not aware of the 
internal reasons of Q etc [18]. Taking into account various factors 
that contribute to the interpretation of £, we will treat it as a free 
model parameter. 

6.2 Note that (8) does not satisfy conditions 2 and 3 above. We 
turn to the last ingredient of the sought rule, which, in particular, 
should achieve consistency with conditions 2 and 3. 

Toward this goal, we assume that V projects the linearly 
combined opinion p (see (8)) onto his original opinion p. Owing to 
(3), we write this transformation as 



Pk = <t>\pk,Pk}/ 2^ l=l ^\pi,Pi], 



(9) 



where the function (j) is to be determined. 

The above projection operation relates to trimming [18,72], a 
human cognitive heuristics, where V tends to neglect those aspects 
of Q's opinion that deviate from a certain reference. In the 
simplest case this reference will be the existing opinion of V. 

To make the projection process (more) objective, we shall 
assume that it commutes with the probabilistic revision: whenever 



Pk-- 



VkPk 



-,Pk- 



YkPk 



i=iYiPi Y,i=\YiPi 



\<k<N, 



(10) 



where y k =Pr(...|fc)>0 are certain conditional probabilities, p is 
revised via the same rule (10): 



Pk : - 



<t>\p'k>P'k\ 



Yk<l>\Pk,Pk] 



o'bP'l] T,l=lYi<l>\pi,Pl] 



(11) 



This feature means that the projection is consistent with 
probability theory: it does not matter whether (3) is applied before 
or after (10). 

It is known that (9) together with (10, 11) selects a unique 
function [30]: 
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<t>\Pk,Pk]=Pk pI o<^<i 



(12) 



where fi quantifies the projection strength: for fi = 1 the projection 
is so strong that V does not change his opinion at all 
(conservatism), while for jU->0, V fully accepts p (provided that 
Pk>0 for all k). (The above commutativity is formally valid also for 
fi<0 or fi> 1, but both these cases are in conflict with (5).) In 
particular, £— >0 and fi— >0 is a limiting case of a fully credulous 
agent that blindly follows persuasion provided that all his 
probabilities are non-zero. (For a sufficiendy small fi, a small p k 
is less effective in decreasing the final probability p k ; see (12). This 
is because p^ = e >l]npic tends to zero for a fixed fi and Pk—>0, while 
it tends to one for a fixed p t and p->0. This interpaly between 
p k -*0 and fl->0 is not unnatural, since the initial opinion of a 
credulous agent is expected to be less relevant. The case of 
credulous agent is of an intrinsic interest and it does warrant 
further studies. However, since our main focus is confirmation 
bias, below we set p=l/2 and analyze the opinion dynamics for 
varying e.) 

The final opinion revision rule reads from (12, 8, 9): 



Pk = 



\/pk[£Pk + (i-£)qk] 

Ef=i VPii £ Pi + (!-£)?/] 



0<£<1. 



(13) 



It is seen to satisfy conditions 1-5. 

(Note that the analogue of (11), p' k ccy k p k , q' k ccy k q k does not 
leave invariant the linear function (8). First averaging, 
£Pk + (l-£)qk and then applying p' k ccy k p k , q'kXy k q k is equiva- 
lent to first applying the latter rules and then averaging with a 
different weight e'. This is natural: once e can be (in principle) 
interpreted as a probability it should also change under 
probabilistic revision process.) 

The two processes were applied above in the specific order: first 
averaging (8), and then projection (9). We do not have any strong 
objective justifications for this order, although certain experiments 
on advising indicate on the order that led to (13) [72]. Thus, it is 
not excluded that the two sub-processes can be applied in the 
reverse order: first projection and then averaging. Then instead of 
(13) we get (3) with: 



F\p k ,q k ;e,n]=£p k + (\-()p' 1 k ql 0<(i<l. 



(14) 



Our analysis indicates that both revision rules (13) and (14) 
(taken with p=\/2) produce qualitatively similar results. Hence, 
we focus on (13) for the remainder of this paper. 

Returning to (1), we note that k = x can be a continuous 
variable, if (for example) the forecast concerns the chance of 
having rain or the amount of rain. Then the respective probability 
densities are: 



p(x) and q{x), ^Axp{x)=\^dxq{x) = \. (15) 



Since the revision rule (13) is continuous and differentiable (in 
the sense defined after (3)), it supports a smooth transition between 
discrete probabilities and continuous and differentiable probability 
densities. In particular, (13) can be written directly for densities: for 
p k ~p(x k )dx we obtain from (13) 



p(x)-- 



x /p(x)[ep(x) + (l-e)q(x)] 
J dx' p(x')[ ep(x') + (1 - e) q(x') ] 



(16) 



Social Judgment Theory and Gaussian Opinions 

Opinion latitudes 

Here we discuss our model in the context of the social judgment 
theory [59,10], and consider several basic scenarios of opinion 
change under the rule (16). 

According to the social judgment theory, an agent who is 
exposed to persuasion perceives and evaluates the presented 
information by comparing it with his existing attitudes (opinions). 
The theory further postulates that an attitude is composed of three 
zones, or latitudes: acceptance, non-commitment and rejection [10,59]. 
The opinion that is most acceptable to V, or the anchor, is located 
at the center of the latitude of acceptance. The theory states that 
persuasion does not change the opinion much, if the persuasive 
message is either very close to the anchor or falls within the 
latitude of rejection [10,59]. The social judgment theory is 
popular, but its quantitative modeling has been rather scarce. In 
particular, to the best of our knowledge, there has been no attempt 
to develop a consistent probabilistic framework for the theory. 
(The literature on the social judgment theory offers some formal 
mathematical expressions that could be fitted to experimental data 
[45] . There is also a more quantitative theory [34] whose content 
is briefly reminded in section III of File S 1 .) 

Let us assume that k = x is a continuous variable (cf. (15)) and 
that p(x) and q(x) are Gaussian with mean mi and dispersion Vx 
{a = V,Q): 



p{x) = 



(x-m-p) 1 



(x-mr 



q{x) = 



\/2nv Q 



(17) 



Effectively, Gaussian probabilistic opinions are produced in 
experiments, when the subjects are asked to generate an opinion 
with «95% confidence in a certain interval [18]. Now we can 
identify the anchor with the most probable opinion mx, while v^ 1 
quantifies the opinion uncertainty. 

The latitude of acceptance amounts to opinions not far from the 
anchor, while the latitude of rejection contains close-to-zero 
probability events, since V does not change his opinion on them; 
recall point 2 from the previous section. One can also identify the 
three latitudes with appropriately chosen zones in the distribution. 
For instance, it is plausible to define the latitudes of acceptance 
and rejection by, respectively, the following formulas of the 3(7 rule 
known in statistics 



xe[m P -2^/v^, m x + 2^/v^\, (18) 



xe(— oo, rap- 3 \Ap]U[«p + 3-^/^,00), (19) 

where the latitude of non-commitment contains whatever is left 
out from (18, 19). Recall that the latitudes of acceptance, non- 
commitment and rejection carry (respectively) 95.4, 4.3 and 0.3% 
of probability. 
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While the definitions (18, 19) are to some extent arbitrary, they 
work well with the rule (16), e.g. if the opinions of V and Q overlap 
only within their rejection latitudes, then neither of them can 
effectively change the opinion of another. Also, V is persuaded 
most strongly, if the anchor of the persuasion falls into the non- 
commitment latitude of V. This is seen below when studying 
change-discrepancy relations. 

Weighted average of anchors 

Next, we demonstrate that the main quantitative theory of 
persuasion and opinion change— the weighted average approach 
[9,27] -is a particular case of our model. We assume that the 
opinions p(x) and q{x) are given as 



P(x)=f(x-m v ), p{x) = g(x-m Q ), 

/'(0) = d//dx|. Y= o = g'(0) = 0, 
/"(0)<0, g"(0)<0, 



(20) 



(21) 



where bofh^x) and g(x) have a unique maximum at x = 0. Hence 
p(x) (resp. q(xj) has a single anchor (maximally probable opinion) 
m-p (resp. mg); see (17) for concrete examples. 

If \mp — OTq| is sufficiently small, p(x) given by (20, 16) has a 
single anchor which is shifted towards that of q(x); see Fig. 1(a). We 
now look for the maximum m~ of p(x) by using (20) in (16). We 

neglect factors of order (D[(mj> — mo) 2 /vp] and 
0[(mp — mgf jvg\ and deduce: 



m~= (1 — o.q) m-p + ocq rriQ, 



(1-6)! 



"(0)1 



g(0) 



(1-0 



|g"(0)| , |T(0)| 



g(0) /(0) 



2e' 



g(0) 



(22) 



(23) 



Eq. (22) is the main postulate of the weighted average approach; 
see [9,27] for reviews. Here hq and 1 — ocq are the weights of Q 
and of V, respectively. For the Gaussian case (17), we have 



Furthermore, we have 



Vp 



(24) 



dciQ 
3, 



1-6 



dtXQ 

~d7 



(25) 



Thus, (Zg's dependence on the involved parameters is intuitively 
correct: it increases with the confidence 1 / Vq of Q, and decreases 
with the confidence 1/vp of V. Note also that ag decreases with e. 

Now let p(x) and q(x) (and hence p(x)) have the same maximum 
mp> = mQ, but Vp«vg; see (17). Expanding (16, 17) over v v — vq 
and keeping the first-order term only we get 




Figure 1. Opinions described via Gaussian densities (17). The 

initial opinion of V is described by Gaussian probability density p(x) 
(blue curve) centered at zero; see (17). The opinion of Q amounts to 
Gaussian probability density q(x) (purple curve) centered at a positive 
value. For all three figures continuous density f(x) {J'=p,q,p) were 
approximated by 100 points {f{xi,)} k = l , Xk+i — Xk =0.1. The resulting 
opinion p(x) of V is given by (16) with t = 0.5 (olive curve), (a) The 
opinion of V moves towards that of Q; mp = 0, o-p = \, mq = 1, <rg = 0.5. 
(b) The maximally probable opinion of V is reinforced; «ip = 0, o> = l, 
mg=0, oq = 0.25. (c) The change of the opinion of V is relatively small 
provided that the Gaussian densities overlap only in the region of non- 
commitment; cf. (18), (19). Whenever the densities overlap only within 
the rejection range the difference between p(x) and p(x) is not visible 
by eyes. For example, if p(x) and q{x) are Gaussian with, respectively, 
m-p=— 3, ;«q = 3, Vp = Vg = l, the Hellinger distance (see (30) for 
definition) h\p,q] =0.99 is close to maximally far, while the opinion 
change is small: h]p,p] = 3A$ x 10~ 2 . 
doi:10.1371/journal.pone.0099557.g001 
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1-e 



VQ- 



l + e 



Vp, 



(26) 



where v~is the dispersion of (non-Gaussian) p(x). Eq. (26) implies 



(v~-vp)(v s -v P ) = 



Oe-vp) >0, 



(27) 



i.e. if l/vg>l/vp (resp. l/v g <l/vp), the final opinion of V 
becomes more (resp. less) narrow than his initial opinion. Fig. 1(b) 
shows that (v~— Vp)(vg — v-p) > 0 holds more generally. 

Thus, the weighted average approach is a particular case of our 
model, where the agent V is persuaded by a slightly different 
opinion. Note also that our model suggests a parameter structure 
of the weighted average approach. 

Opinions and bump-densities 

Gaussian densities (with three latitudes) do correspond to the 
phenomenology of social psychology. However, in certain 
scenarios one might need other forms of densities, e.g., when the 
probability is strictiy zero outside of a finite support. Such opinions 
can be represented by bump-functions 



X(x; b) = M{b)exp 



for \x\ < 1 



(28) 



Opinion Change vs Discrepancy 

One of extensively studied questions in social psychology is how 
the opinion change is related to the discrepancy between the initial 
opinion and the position conveyed by the persuasive message 
[10,35,40,45,69]. Initial studies suggested a linear relationship 
between discrepancy and the opinion change [35], which agreed 
with the prediction of the weighted average model. Indeed, (22) 
yields the following linear relationship between the change in the 
anchor and the initial opinion discrepancy of V and Q: 



m~—mp = ocq (rriQ—m-p). 



(29) 



However, consequent experiments revealed that the linear 
regime is restricted to small discrepancies only and that the actual 
behavior of the opinion change as a function of the discrepancy is 
non-monotonic: the opinion change reaches its maximal value at 
some discrepancy and decreases afterward [10,40,45,69]. 

To address this issue within our model, we need to define 
distance h[p,q] between two probability densities p(x) and q{x). 
Several such distances are known and standardly employed [32]. 
Here we select the Hellinger distance (metric) 



h]p,q] i 



1 

72 



1/2 



(30) 



= 0 for x< — 1 and x> 1. 

where b>0 is a parameter, J\f(b) is the normalization and the 
support of the bump function was chosen to be [—1,1] for 
concretness. The advantage of the bump function that is infinitely 
differentiable despite of having a finite support. 

For sufficiently large b, x( x ; b) is close to a Gaussian, while for 
small b, x(x; b) represents an opinion that is (nearly) homogeneous 
on the interval [—1,1]; see Fig. 2. The opinion revision with bump 
densities follows to the general intuition of rule (16); see Fig. 2. 



dx^/p(x)q(x) 



1 l/ 2 



(31) 



Since \J p(x) is a unit vector in the l-i norm, Eq. (30) relates to 
the Euclidean (fe-norm) distance. It is applicable to discrete 
probabilities by changing the integral in (30, 31) to sum. For 
Gaussian opinions (17) we obtain 



h\p,q] -- 



(VQV-p) 



1/2 



VQ + Vp 



1/2 



(niQ-mp) 1 
e ~ 4(vg + vp) 



1/2 



(32) 




Figure 2. Opinions described via bump densities (28). Blue curve: 
the initial opinion of V given by (28) with b = 1. Purple curve: the 
opinion of Q described by (28) with b = 0.001. Olive curve: the resulting 
opinion of V obtained via (16) with e = 0.5. 
doi:1 0.1 371 /journal.pone.0099557.g002 



A virtue of the Hellinger distance is that it is a measure of 
overlap between the two densities; see (31). We stress, however, 
that there are other well-known distances measures in statistics 
[32]. All results obtained below via the Hellinger distance will be 
checked with one additional metric, the total variation (£i-norm 
distance): 



dx\p(x)-q(x)\. 



(33) 



(To motivate the choice of (33), let us recall two important 
variational features of this distance [32]: (1) &\p,q] = 
maxjj^ji |J n dx(p(x) — q(x)) \ . (2) Define two (generally dependent) 
random variables X, Y with joint probability density g(x,y) such 
that J dxg(x,y) = q(y), J dy g(x,y) =p(x). Now it holds that 
5\p,q] = min[Pr(l- where Vr{X ^ Y) = 1 - J dx g(x,x), 

and the minimization is taken over all g(x,y) with fixed marginals 
equal to p(x) and q{y), respectively.) 
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e 



C 
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m 

Figure 3. Opinion change versus discrepancy, (a) The opinion 
change is quantified via the Hellinger distance h = h[p,p] between the 
old and new opinion of V (blue curves); see (30) for the definition. For 
comparison we also include the total variance distance b = b\p,p\ 
(purple curves); see (33). These two distances are plotted versus the 
discrepancy m = \m-p — mg\. The initial opinion of the agent V is 
Gaussian with mp = 0 and v-p = 1; see (17). The opinion of Q is Gaussian 
with niQ = m and Vg = l. Thus m quantifies the initial distance between 
the opinions of V and Q. The final opinion p(x) is given by (13). 
Different curves correspond to different e. Blue curves: h(m)=h\p,p] for 
t = 0.1 (upper curve) and f = 0.5 (lower curve). Purple curves: 
d(m) = S\p,p] for 6 = 0.1 (upper curve) and e = 0.5 (lower curve). The 
maximum of h(m) (S(m)) is reached at m c h (m c i). (b) m e i, (m c g) is the 
point where h(m) (<5(m)) achieves its maximum as a function of m. Blues 
points: m c k(e) versus f for same parameters as in (a). m c i,(e) grows both 
for e^l and e^0, e.g. w c /,(0.01) = 3.29972, m ( .,,(0.0001) = 4.53052, 
m c /,(0.9) = 2.94933, m,.;,(0.999) = 4.12861. Purple points: m cS {c) versus e 



for same parameters as in (a), (c) The difference of the anchors 
(maximally probable values) Am = m~—m-p versus mg =/« for the initial 

opinions of V and Q given by (17) under mp = 0, vp = l, mq =m and 
Vg=l. The final opinion p(x) of V (and its maximally probable value 
m~) if found from (13) under t = 0.1 (black points), c = 0.25 (blue points) 

and £ = 0.5 (red points). 
doi:10.1371/journal.pone.0099557.g003 

The opinion change is characterized by the Hellinger distance 
li[p,p] between the initial and final opinion of V, while the 
discrepancy is quantified by the Hellinger distance h\p,q\ between 
the initial opinion of V and the persuading opinion. For 
concreteness we assume that the opinion strengths 1/vp and 
1/vq are fixed. Then h\p,q\ reduces to the distance m = \mp — itiq] 
between the anchors (peaks of p(x) and q(xj); see (32). 

Fig. 3(a) shows that the change h[p,p] is maximal at m = m c f l ; it 
decreases for m>m c i l , since the densities of V and Q have a 
smaller overlap. The same behavior is shown by the total variation 
<5[/?,/>] that maximizes at m = m c s; see Fig. 3(a). 

The dependence of m c i, (and of m c s) on £ is also non- 
monotonic; Fig. 3(b). This is a new prediction of the model. Also, 
m c f, and m c $ are located within the latitude of non-commitment of 
V (this statement does not apply to m c i,, when e is close to 1 or 0); 
cf. (18, 19). This point agrees with experiments [10,69]. 

Note that experiments in social psychology are typically carried 
out by asking the subjects to express one preferred opinion under 
given experimental conditions [10,35,40,45,69]. It is this single 
opinion that is supposed to change under persuasion. It seems 
reasonable to relate this single opinion to the maximally probable 
one (anchor) in the probabilistic set-up. Thus, in addition to 
calculating distances, we show in Fig. 3(c) how the final anchor m~ 

of V deviates from his initial anchor m-p. 

Fig. 3(c) shows that for e>0.25, the behavior of Am = \m — m-p\ 

as a function of w = |mp — mg| has an inverted-U shape, as 
expected. It is seen that Am saturates to zero much faster 
compared to the distance h\pj>\. In other words, the full 
probability p keeps changing even when the anchor does not 
show any change; cf. Fig. 3(c) with Fig. 3(a). 

A curious phenomenon occurs for a sufficiently small e; see 
Fig. 3(c) with e = 0.1. Here Am drops suddenly to a small value 
when m passes certain crticial point; Fig. 3(c). The mechanism 
behind this sudden change is as follows: when the main peak of/)(x) 
shifts towards fflg.a second, sub-dominant peak of/>(.x;) appears at 
a value smaller than m-p. This second peak grows with m and at 
some critical value it overcomes the first peak, leading to a 
bistability region and an abrupt change of Am. The latter arises 
due to a subtle interplay between the high credibility of Q (as 
expressed by a relatively small value of e) and sufficiently large 
discrepancy between V and Q (as expressed by a relatively large 
value of m). Recall, however, that the distance h\p,p] calculated via 
the full probability does not show any abrupt change. 

The abrupt change of Am is widely discussed (and experimen- 
tally confirmed) in the attitude change literature; see [49] for a 
recent review. There the control variables for the attitude change- 
information and involvement [49] -differ from e and m. However, 
one notes that the weight e can be related to the involvement: 
more V is involved into his existing attitude, larger is £, while the 
discrepancy m connects to the (new) information contained in the 
persuasion (m = 0 naturally means zero information). 

Let us finally consider a scenario where the change-discrepancy 
relationship is monotonic. It is realized for m-p = ntQ (coinciding 
anchors), where the distance (32) between p(x) and q(x) is controlled 
by Vq (for a fixed Vp). In this case, vthe change h\p,p] is a 
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monotonic function of discrepancy h[p,q\: a larger discrepancy 
produces larger change. This example is interesting, but we are 
not aware of experiments that have studied the change-discrep- 
ancy relation in the case of two identical anchors. 

Order of Presentation 

Recency versus primacy 

When an agent is consecutively presented with two persuasive 
opinions, his final opinion is sensitive to the order of presentation 
[10,13,25,34,35,50,52]. While the existence of this effect is largely 
established, its direction is a more convoluted matter. (Note that 
the order of presentation effect is not predicted by the Bayesian 
approach; see (2).) Some studies suggest that the first opinion 
matters more (primacy effect), whereas other studies advocate that 
the last interaction is more important (recency effect). While it is 
not completely clear which experimentally (un)controlled factors 
are responsible for primacy and recency, there is a widespread 
tendency of relating the primacy effect to confirmation bias 
[13,52]. This relation involves a qualitative argument that we 
scrutinize below. 

We now define the order of presentation effect in our situation. 
The agent V interacts first with Q (with probability density q(x)), 
then with Q' with probability density q'(x). To ensure that we 
compare only the order of Q and Q' and not different magnitudes 
of influences coming from them, we take both interactions to have 
the same parameter 0 < £ < 1 . Moreover, we make Q and Q' 
symmetric with respect to each other and with respect to V, e.g. if 
p(x), q(x) and q'(x) are given by (17) we assume 

Vq' = v q , mQ'—m v =m v — m Q . (34) 

We would like to know whether the final opinion p{x\q,q') ofV 
is closer to q(xj (primacy) or to q'(x) (recency). 

In the present model (and for 0<£<1), the final opinion 
p(x\q,q'} is always closer to the last opinion q'(x), both in terms of 
maximally probable value and distance. In other words, the model 
unequivocally predicts the recency effect. In terms of the Hellinger 
distance (30) 




-4-2 0 2 4 



Figure 4. Order of presentation effect. Blue curve: The initial 
opinion of V is described by Gaussian probability density p(x) with 
m-p = 0 and v-p = 1; see (17). Purple (resp. olive) curve: the initial opinion 
of Q (resp. Q') are given by (17) with mg = 1.5 (resp. nig = — 1.5) and 
vg=0.5 (resp. vq =0.5). Green curve: the resulting opinion of V after 
interacting first with Q and then with Q'. Both interactions use c = 0.5. 
The final opinion of V is inclined to the most recent opinion (that of Q') 
both with respect to its maximally probable value and distance. The 
final opinion of V has a larger width than the initial one. 
doi:10.1371/journal.pone.0099557.g004 



interaction with Q' should influences Vs opinion less, due to a 
smaller overlap between the opinions of Q' and V before the 
second interaction. In fact, this is the standard argument that 
relates primacy effect to the confirmation bias [13,52]: the first 
interaction shapes the opinion of V and makes him confirmation- 
ally biased against the second opinion. This argument does not 
apply to the present model due to the following reason: even 
though the first interaction shifts Vs anchor towards Q's opinion, 
it also deforms the shape of the opinion; see Fig. 1(a). And the 
deformation produced by our revision rule happens to favor the 
second interaction more. 

To get a deeper understanding of the recency effect, let us 
expand (13) for small rj = 1— e: 



h\p{x\q,q'),q'] <h\p(x\q,q'),q]. (35) 



Pk=Pk+j{qk-Pk) 



See Fig. 4 for an example (In our model primacy effect exists in rj 2 , \S~^ ill ~Pl) 2 fl[ 31 

: boomerang regime £>1; see below.) a ^ >k ' £ 1 n , ^ >' 



the boomerang regime £ > 1 ; see below. 

To illustrate (35) analytically on a specific example, consider the 
following (binary) probabilistic opinion of V, Q and Q' 



I 



Pi 



If now V interacts with an agent Q' having opinion q', the 
p = {\/2,\/2), q=(0,\), q' = (\fi). (36) resulting opinion p(q,q') reads from (37): 



V is completely ignorant about the value of the binary variable, 
while Q and Q' are fully convinced in their opposite beliefs. If V 
interacts first with Q and then with Q' (both interactions are given 

by (13) with £= -), the opinion of V becomes (0.52727, 0.47273). 

This is closer to the last opinion (that of Q). 

The predicted recency effect in our model seems rather 
counterintuitive. Indeed, since the first interaction shifts the 
opinion of V towards that of Q, one would think that the second 



PlM4)=Pk 



~{<ik-Pk)+-z(Pk-y)y. 

2 0 , 



; — (q'l-pif 

Pi 
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+ j(Pk-qk) + 0[r, 3 }. (38) 

Hence in this limit pk(q,q')—pk(q',q) depends only on q't — qk 
(and not e.g. on qi£k)'- 

Pk(q,q') -Pk(q\q) = I 2 [q'k ~ q k ] /4 + Otf] . (39) 

It is seen that the more probable persuasive opinion (e.g. the 
opinion of Q' if q'k> q k ) changes the opinion of V if it comes later. 
This implies the recency effect. Indeed, due to symmetry 
conditions for checking the order of presentation effect we can 
also look at h\p(q,q'),q]—h[p(q',q),q]. Using (39) we get for this 

quantity: z , fe-gfc]vVM:> 0 , again due to 

symmetry conditions. 

Note that this argument on recency directly extends to more 
general situations, where the agent is exposed to different opinions 
multiple times. For instance, consider an exposure sequence 
qqq' q' and its reverse q' q' q q. It can be shown that the model 
predicts a recency effect in this scenario as well. For this case, we 
get instead of (39): p k (q,q')-p k (q',q) = ri 2 [q' k -q k ]+0[ri 3 ]. 

Note that the primacy-recency effect is only one (though 
important!) instance of contextual and non-commutative phenom- 
ena in psychology; see [11,66] and references therein. Hence in 
section IV of File SI we study a related (though somewhat less 
interesting) order of presentation effect, while below we discuss our 
findings in the context of experimental results. 

Experimental studies of order of presentation effect 

We now discuss our findings in this section in the context of 
experimental results on primacy and recency. The latter can be 
roughly divided into several group: persuasion tasks [10,50], 
symbol recalling [70], inference tasks [34], and impression 
formation [7,9]. In all those situations one generally observes 
both primacy and recency, though in different proportions and 
under different conditions [34]. Generally, the recency effect is 
observed whenever the retention time (the time between the last 
stimulus and the data taking) is short. If this time is sufficiendy 
long, however, the recency effect changes to the primacy effect 
[10,50,62,70]. The general interpretation of these results is that 
there are two different processes involved, which operate on 
different time-scales. These processes can be conventionally 
related to short-term and long-term memory [70], with the 
primacy effect related to the long-term memory. In our model the 
longer time process is absent. Hence, it is natural that we see only 
the recency effect. The prevalence of recency effects is also seen in 
inference tasks, where the analogue of the short retention time is 
the incremental (step-by-step) opinion revision strategy [34]. 

At this point, let us remind the importance of symmetry 
conditions [such as (34)] for observing a genuine order of 
presentation effect. Indeed, several experimental studies-in 
particular those on impression formation-suggest that the order 
of presentation exists due to different conditions in the first versus 
the second interaction [7,10,34,68,]. (In our context, this means 
different parameters e and (! for each interaction). For instance, 
Refs. [7,10] argue that the primacy effect is frequently caused by 
attention decrement (the first action /interaction gets more 



attention); see also [68] in this context. This effect is trivially 
described by our model, if we assume e to be sufficiendy smaller 
than e'. In related experiments, it was shown that if the attention 
devoted to two interactions is balanced, the recency effect results 
[33], which is consistent with the prediction of our model. 

At the same time, in another interesting study based on 
subjective probability revision, where the authors had taken special 
measures for minimizing the attention decrement, the results 
indicated a primacy effect [55] . 

We close this section by underlining the advantages and 
drawbacks of the present model concerning the primacy-recency 
effect: the main advantage is that it demonstrates the recency effect 
and shows that the well-known argument on relating confirmation 
bias to primacy does not hold generally. The main drawback is 
that the model does not involve processes that are supposedly 
responsible for the experimentally observed interplay between 
recency and primacy. In the concluding section we discuss possible 
extensions of the model that can account for this interplay. 

Cognitive Dissonance 

Consider an agent whose opinion probability density has two 
peaks on widely separated events. Such a density-with the most 
probable opinion being different from the average-is indicative of 
cognitive dissonance, where the agent believes in mutually 
conflicting things [10,26]. 

The main qualitative scenario for the emergence of cognitive 
dissonance is when an agent-who initially holds a probabilistic 
opinion with a single peak-is exposed to a conflicting information 
coming from a sufficiendy credible source [10,26]. We now 
describe this scenario quantitatively. 

Consider again the opinion revision model (16, 17), and assume 
that \m-p — mQ\ is neither very large nor very small (in both these 
cases no serious opinion change is expected), Vq/vj><\ (self- 
assured persuasive opinion) and 0 < e < 1 . In this case, we get two 
peaks (anchors) for the final density p(x). The first peak is very 
close to the initial anchor of p(x), while the second closer to the 
anchor of q(x); see Fig. 5(a). Thus, persuasion from Q whose 
opinion is sufficiendy narrow and is centered sufficiently close (but 
not too close) to Vs. initial anchor leads to cognitive dissonance: V 
holds simultaneously two different anchors, the old one and the 
one induced by Q. 

There are 3 options for reducing cognitive dissonance: 

(i) Increase e making it closer to 1, i.e. making Q less credible; 
see Fig. 5(b). 

(ii) Decrease the width of the initial opinion of V. 

(Hi) Decrease e making Q more credible. In this last case, the 
second peak of p(x) (the one close to the anchor of Q) will be 
dominant; see Fig. 5(c). 

To understand the mechanism of the cognitive dissonance as 
described by this model, let us start from (1) and assume for 
simplicity that the opinion of Q is certain: q k = 0 for k ^ / and 
qi = \. We get from (13): 

Pk = P , k for k # /, (40) 

I -PI +PlJe+{l-e)pr l 



p,^e + (l-e)p, 1 
1 -Pi +piyje + (l-e)pf 1 
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The existence of at least two widely different probable opinions 
is only one aspect of cognitive dissonance [10,26]. Another aspect 
(sometimes called Freud-Festinger's law) is that people tend to 
avoid cognitive dissonance: if in their action they choose one of the 
two options (i.e. one of two peaks of the subjective probability), 
they re-write the history of their opinion revision so that the 
chosen option becomes the most probable one [10,26]. This aspect 
of cognitive dissonance found applications in economics and 
decision making [2,73]. The above points (i)—(ui) provide concrete 
scenarios for a such re-writing. 

Repeated Persuasion 

Here we analyze the opinion dynamics under repeated 
persuasion attempts. Our motivation for studying this problem is 
that repeated exposure to the same opinion is generally believed to 
be more persuasive than a single exposure. 

Under certain conditions {pklk^O, for all k and 1>£>0) we 
show that the target opinion converges to the persuading opinion 
after sufficient number of repetition. Below we also examine how 
exactly this convergence takes place. 

Assume that V revises his opinion repeatedly with the same 
opinion of Q. Eq. (13) implies (1 <k<N) 



[n+l] 
Pk 



P? [ep V k n +{\-e)q k 



(42) 



where 1 >e>0, and n= 1,2,... is the discrete time. For simplicity, 
we assume 




p^ l =p k >0, q k >0ior l<k<N. 



(43) 



Eq. (42) admits only one fixed point q = {qk] k=\- Section VI of 



File S 1 shows that for any convex 



d z /G0 



dy 2 

®\p ln + l] ;q]<<S>\p ln] ;q] 



> 0, function fj) one has 
(44) 



®[P; ?] = Z^k=\ 9kf(Pk/9k)- 



(45) 



0.0 ^ — 1 — ' — 1 — — , — , — l — , — — i . J 

-4-2 0 2 4 

Figure 5. Cognitive dissonance, (a) Blue (resp. purple) curve: the 
initial opinion of agent V (resp. Q) described by probability density p(x) 
(resp. q(x)). Olive curve: the final opinion p(x) ofV as given by (16) with 
e = 0.35. Here p(x) and q(x) are defined by (17) with mp = 0, vp = l, 
mg=2, vq = 0.1. The final opinion develops two peaks of comparable 
height (cognitive dissonance), (b) Avoiding the cognitive dissonance 
due to a larger f = 0.75: the second peak is much smaller (other 
parameters are those of (a)), (c) Avoiding the cognitive dissonance due 
to a smaller e = 0.05: the first peak is much smaller (other parameters 
are those of (a)). 

doi:1 0.1 371 /journal.pone.0099557.g005 

Now pi I p\ > 1 > pk/pk, where k ^ /; hence even if / was on the 
tail of {pk}k= i , it is possible to make it a local (or even the global) 
maximum of {pk\k=\ provided that £ is not close to 1. 



Hence <!>[/?; q] is a Lyapunov function of (42). Since cD[p; q] is a 
convex function of p, <5>\p; q] >/(l) = <]%; q] and 7(1) is the unique 
global minimum of <t>{p; q]. Section VI of File SI shows that the 
equality sign in (45) holds ony for p l " +l] =p [n] . Thus ®\p l " ] ;q] 
monotonically decays to /(l ) = <f>[q; q] showing that the fixed point 
q is globally stable. More generally, the convergence reads: 

p l k^\Pk ] ]<ik/Ylti flAfc where « x> °)= 1 and «°)=°- 

To illustrate (44, 45), one can take f{y)=— yjy. Then (44) 
amounts to decaying Hellinger distance (30). Many other 
reasonable measures of distance are obtained under various 
choices of f. For instance, f(y) = \y — 1| amounts to decaying total 
variation distance (33), while f(j')= — lnj leads to the decaying 
relative entropy (Kullback-Leibler entropy). 

As expected, 0 < e < 1 influences the convergence time. We 
checked that this time is an increasing function of £, as expected. 
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In section VI of File S 1 we also show that the convergence to the 
fixed point respects the Le Chatelier principle known in 
thermodynamics [4]: the probabilities of those events that are 

overestimated from the viewpoint of Q (i.e. >qic) tend to decay 
in the discrete time. Likewise, probabilities of the underestimated 
events (i.e. <qk) increase in time. 

Let us consider the Hellinger distance h„ =h\p l " + between 
two consecutive opinions of V evolving as in (42). It is now possible 
that 



Opinion revision rule 

We now suggest a simple modification of our model that 
accounts for the basic phenomenology of the boomerang effect. 

Recall our discussion (around (8)) of various psychological and 
social factors that can contribute into the weight £. In particular, 
increasing the credibility of Q leads to a larger 1 — e. Imagine now 
that Q has such a low credibility that 

£>1. (47) 



maxi. 



3 [h n ]= h m J=hu 



(46) 



i.e. the largest change of the opinion of V comes not from the first, 
but from one of intermediate persuasions. A simple example of this 
situation is realized for jV = 3, an initial probability vector 
p = (0.98,0.01,0.01) and £ = (0.01,0.01,0.98) in (43). We then 
apply (42) under £ = 0.5. The consecutive Hellinger distances read 
hi = 0.1456</? 2 = 0.1567>/? 3 =0.1295>/i 4 .... Hence the second 
persuasion changes the opinion more than others. For this to hold, 
the initial opinion p of V has to be far from the opinion q of Q. 
Otherwise, we get a more expected behavior h\>h.2>h.T,>hi\... 
meaning that the first persuasion leads to the largest change. 
(The message of (46) is confirmed by using the discrete version 

5\p,q] = — ^2 k \Pk — 1k\ of the distance (33). Define 

d n = 5\p^+^jM\. Then with p = (0.98,0.01,0.01) and 
£ = (0.01,0.01,0.98) we get <5i =0.0834, <5 2 =0.1636, 
<5 3 =0.1717, <5 4 =0.1444.) 

We conclude by stressing that while repeated persuasions drive 
the opinion to its fixed point monotonically in the number of 
repetitions, it is generally not true that the first persuasion causes 
the largest opinion change, i.e. the law of diminishing returns does 
not hold. To obtain the largest opinion change, one should 
carefully choose the number of repetitions. 

Finally, note that the framework of (42) can be applied to 
studying mutual persuasion (consensus reaching). This is described 
in Section VII of File SI; see also [23] in this context. 

Boomerang (Backfire) Effect 

Definition of the effect 

The boomerang or backfire effect refers to the empirical observation 
that sometimes persuasion yields the opposite effect: the persuaded 
agent V moves his opinion away from the opinion of the 
persuading agent, Q, i.e. he enforces his old opinion [53,58,64,69]. 
Early literature on social psychology proposed that the boomerang 
effect may be due to persuading opinions placed in the latitude of 
rejection [69], but this was not confirmed experimentally [40]. 

Experimental studies indicate that the boomerang effect is 
frequently related with opinion formation in an affective state, 
where there are emotional reasons for (not) changing the opinion. 
For example, a clear evidence of the boomerang effect is observed 
when the persuasion contains insulting language [1]. Another 
interesting example is when the subjects had already announced 
their opinion publicly, and were not only reluctant to change it (as 
for the usual conservatism), but even enforced it on the light of the 
contrary evidence [64] (in these experiments, the subjects who did 
not make their opinion public behaved without the boomerang 
effect). A similar situation is realized for voters who decided to 
support a certain candidate. After hearing that the candidate is 
criticized, the voters display a boomerang response to this criticism 
and thereby increase their support [53,58]. 



Recall that £ = 1 means a special point, where no change of 
opinion of V is possible whatsoever; cf. (13). 

After analytical continuation of (13) for £ > 1 , the opinion 
revision rule reads 



Pk 



\/Pk\£Pk + (\-e)qk\ 

Ef=i \Zp/kp/+(i-<0?/l' 



(48) 



with obvious generalization to probability densities. The 
absolute values in (48) are necessary to ensure the positivity of 
probabilities. 

It is possible to derive (rather simply postulate) (48). Toward this 
end, let us return to the point 6.1 and (8). During the opinion 
combination step, V forms epk + (l —t)qk which in view of £> 1 
can take negative values and hence is a signed measure. Signed 
measures have all formal features of probability besides positivity 
[6,14,19,65]; see section V of File SI for details. There is no 
generally accepted probabilistic interpretation of signed measures, 
but in section V of File SI we make a step towards such an 
interpretaion. There we propose to look at a signed measure as a 
partial expectation value defined via joint probability of the 
world's states and certain hidden degrees of freedom (e.g. 
emotional states). After plausible assumptions, the marginal 
probability of the world's states is deduced to be 




Figure 6. Opinion change in the boomerang regime. Blue (resp. 
purple) curve: the initial opinion of agent V (resp. Q) described by 
probability density p(x) (resp. g(x)). Olive curve: the final opinion p(x) of 
V given by (16) with e = 2. Here p(x) and q(x) are given by (17) with 
m-p = 0 and vp = mg = i'g = 1. The anchor (maximally probable opinion) 
of V not only moves away from the anchor of Q; but it is also enhanced: 
the (biggest) peak of j>(x) is larger than that of p(x). The second (smaller) 
peak of p(x) arises because the initial probability of V located to the 
right from the anchor rag of Q, moves away from mg; p(xj gets a local 
minimum close to mg. 
doi:10.1371/journal.pone.0099557.g006 
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Figure 7. Order of presentation effect in the boomerang 
regime. The same as in Fig. 4 but for 6 = 1.5 (boomerang regime). 
Now the final opinion of V is inclined to the first opinion (that of Q) 
with respect to the distance. The initial maximally probable opinion of 
V is still maximally probable. Moreover, its probability has increased 
and the width around it has decreased. The final opinion has 3 peaks. 
doi:1 0.1 371 /journal.pone.0099557.g007 



Figure 8. Illustration of the order of presentation effect in the 



gle) 

boomerang regime. p x (q,q')= 

g(c) + 1 

doi:10.1371/journal.pone.0099557.g008 



given by (50, 51) versus f. 



Pk = \epk + (\-e)q k \/J2 \ep k + ( 1 - e)q k \, (49) 



p{qrf) -- 



l 



g{e) + V g(e)+lj' 



(50) 



We obtain (48) after applying (9, 10) to (49). 



g(e) = 



v^4-(l-£)^2^ 



(51) 



Scenarios of opinion change 

According to (47, 48) those opinions of V which are within the 
overlap between p and q (i.e PkHk 5^0) get their probability 
decreased if Pk/qk x ( e — 1 )/e < 1 , i-e. if the initial p k was already 
smaller than q ^ In this sense, V moves his opinion away from that 
of Q. Hence for continuous densities p(x) and q(x) there will be a 
point x 0 , where p(xq) is close to 0. This point is seen in Figs. 6 and 
7. 

Fig. 6 illustrates the shape of p(x) produced by (48) for initially 
Gaussian opinions (17) of V and Q. It is seen that Vs anchor 
moves away from Q's anchor, while the width of p(x) around the 
anchor is more narrow than that of p(x); cf. with Fig. 4. To 
illustrate these points analytically, we return to (29, 24, 24) that for 

v-pzzvQ and m-pxniQ predict m ~— m-p = - (mQ — my): for 

£>1, Vs anchor drifts away from Q's anchor. 

Likewise, whenever the two anchors are equal, mp = mQ, 
inequality (27) is reversed in the boomerang regime (47). 

Let us now consider the impact of the presentation order under 
this settings. We saw that for 0<£< 1 the model predicts recency 
effect. For 1 < t we expect the recency effect is still effective as 
implied by the argument (39). However, the situation changes 
drastically for £ sufficiently larger than 1, as indicated in Fig. 7. 
Now the primacy effect dominates, i.e. instead of (35) we get the 
opposite inequality. Fig. 7 also shows that interaction with two 
contradicting opinions (in the boomerang regime) enforces the 
initial anchor of V . 

To understand the primacy-recency effect analytically, consider 
the example (36), and recall that V interacts first with Q and then 
with Q! with the same parameter £. The resulting opinion p(q,q') 
of V reads: 



Fig. 8 shows how p\(q,q') = — — — - behaves as a function of £. 

The recency effect holds for £ < 2 + V2; for e > 2 + V2 we get 
primacy. Similar results are obtained for initially Gaussian 
opinions. 

Thus, in the present model, the primacy effect (relevance of the 
first opinion) can be related to the boomerang effect. 

We now examine the emergence of cognitive dissonance in the 
boomerang regime £ > 1 . Our results indicate that in this regime 
the agent is more susceptible to cognitive dissonance; cf. Fig. 6 
with Figs. 1. The mechanism of the increased susceptibility is 
explained in Fig. 6: P's opinion splits easier, since the probability 
mass moves away (in different directions) from the anchor of Q. 

Let us now assume that V repeatedly interacts with the same 
opinion of Q [cf. (42)]: 



ln + l] (x)= \/gg) + -QgWj (52) 

Jdx'V>M(x') \epW(x') + (\ -e)q{x')\ ' 

where n=l,2,... is the discrete time. Starting from initially 
Gaussian opinion, V develops two well-separated peaks, which is 
another manifestation of cognitive dissonance: the smaller peak 
moves towards the anchor of Q and finally places itself within the 
acceptance latitude of Q, where the larger peak becomes more 
narrow and drifts away from q(x); see Fig. 9. After many iterations 
( ~ 1 0 for parameters of Fig. 9) the larger peak places itself within 
the rejection latitude of Q, at which point p^ n \x) stops changing 
(stationary opinion). The above scenario suggests that in the 
boomerang regime there is a finite probability that the target agent 
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Figure 9. Repeated persuasion in the boomerang regime. Blue 
(resp. purple) curve: the initial opinion of agent V (resp. Q) described by 
probability density p(x) (resp. q(xj) as given by (17) with m-p = 0, 
\>p = Vq = 1, Big = 1. Olive curve: the opinion of V after 50 iterations (52) 
with t = 2. 

doi:1 0.1 371 /journal.pone.0099557.g009 

will eventually be persuaded after repeated exposure to the same 
opinion. 

Let us mention an experimental work that is relevant to our 
discussion above. Ref. [58] carried out experiments with subjects 
displaying boomerang effect, where each subject was exposed to 
sufficiently many different (but still similar) persuasive opinions. It 
was found that, sooner or later, the subjects exit the boomerang 
regime, i.e. they start to follow the persuasion [58]. Our set-up is 
somewhat different in that the subject (P) is repeatedly exposed to 
the same persuading opinion. Modulo this difference, our 
conclusion is similar to the experimental finding: the agent starts 
following the persuasion with a certain probability. 

Discussion 

We presented a new model for opinion revision in the presence 
of confirmation bias. The model has three inputs: the subjective 
probabilistic opinions of the target agent V and a persuading 
(advising) agent Q, and the weight of Q as perceived by V. 

The basic idea of the opinion revision rule is that no opinion 
change is expected if the persuasion is either too far or too close to 
the already existing opinion [15,36,60]. The opinion revision rule 
is not Bayesian, because the standard Bayesian approach does not 
apply to processes of persuasion and advising; see the second 
section for more details. 

The model accounts for several key empirical observations 
reported in social psychology and quantitatively interpreted within 
the social judgment theory. In particular, the model allows to 
formalize the concept of opinion latitudes, explains the structure of 
the weighted average approach to opinion formation, and relates 
the initial discrepancy (between the opinions of V and Q) to the 
magnitude of the opinion change (shown by V). In all these cases 
our model extends and clarifies previous empiric results, e.g. it 
elucidates the difference between monotonic and non-monotonic 
change-discrepancy relations, identifies conditions under which 
the opinion change is sudden, as well as provides a deeper 
perspective on the weighted average approach. 

New effects predicted by the model are summarized as follows. 

(i) For the order of presentation set-up (and outside of the 
boomerang regime) the model displays recency effect. We 
suggested that the standard argument that relates confirmation 



bias to the primacy effect does not work in this model. In this 
context we recall a widespread viewpoint that both recency and 
primacy relate to (normative) irrationality; see e.g. [13]. However, 
the information which came later is generally more relevant for 
predicting future. Hence recency can be more rational than 
primacy. 

In many experimental set-ups the recency changes to primacy 
upon increasing the retention time; see e.g. [70]. Our model 
demonstrates the primacy effect only in the boomerang regime (i.e. 
only in the special case). Hence, in future it needs to be extended 
by involving additional mechanisms, e.g. those related to "long- 
term memory" processes which could be responsible for the above 
experimental fact. Recall in this context there are several other 
theoretical approaches that address the primacy-recency difference 
[11,34,42,56,66]. 

(ii) The model can be used to describe the phenomenon of 
cognitive dissonance and to formalize the main scenario of its 
emergence. 

(in) Repeated persuasions display several features implying 
monotonous change of the target opinion towards the persuading 
opinion. However, the opinion changes do not obey the law of 
diminishing returns, or in other words, the first persuasion is not 
always leads to the largest change. These findings may contribute 
to better understanding the widespread use of repeated persua- 
sions. 

(w) We proposed that the boomerang effect is related to the limit 
of this model, where the credibility of persuasion is (very) low. A 
straightforward implementation of this assumption led us to a 
revision rule that does describe several key observational features 
of the boomerang effect and predicts new ones; e.g. that in the 
boomerang regime the agent can be prone to primacy effect and to 
cognitive dissonance. There are, however, several open problems 
with the opinion revision rule in the boomerang regime. They 
should motivate future developments of this model. One problem 
concerns relations of the revision rule with signed measures that at 
a preliminary level were outlined in section V of File S 1 . Another 
problem is that the revision rule in the boomerang regime (and 
only there) is not completely smooth, since it includes the function 
\x\, whose second derivative is singular. We do hope to clarify 
these points in future. 

In this paper we restricted ourselves by studying few (two or 
three) interacting agents with opinions described via subjective 
probabilities. However, these probabilities can also represent an 
ensemble of agents each one having a fixed (single) opinion, a 
useful viewpoint on subjective probabilities advocated in Ref. [37]. 
In future we plan to explore this point and also address the opinion 
dynamics for collectives of agents. This last aspect was recently 
extensively studied via methods of statistical physics; see [20,63] 
for reviews. 
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